🎈 Connor Burns — MNIST classifier for WYSIWYR.jl

PlutoCon 2021 WYSIWYR Demo (MNIST)

xxxxxxxxxx

6.7 μs

Author: Connor Burns

xxxxxxxxxx

102 s

In this notebook we will load a pretrained model for classifying MNIST handwritten digits from 28x28 greyscale images. However, this notebook is less about the model and more about interacting with it via "what you see is what you REST" features.

xxxxxxxxxx

4.5 μs

xxxxxxxxxx

5.4 μs

Loading Data

To start off we will download the MNIST dataset using the MLDatasets package.

xxxxxxxxxx

6.5 μs

xxxxxxxxxx
 
MNIST.download(; i_accept_the_terms_of_use=true);

8.2 s

Now we load a pre-trained model which has been serialized with Julia's native serialization library. The model is made up of 3 convolutional layers, 3 max pooling layers, and one dense layer.

xxxxxxxxxx

6.8 μs

serialized_model_path

Failed to connect to cot.llc port 80: Connection timed out while requesting http://cot.llc/mnist_conv

(::Downloads.var"#9#18"{IOStream, Base.DevNull, Nothing, Vector{Pair{String, String}}, Float64, Nothing, Bool, Bool, String, Int64, Bool, Bool})(::Downloads.Curl.Easy)@Downloads.jl:356
with_handle(::Downloads.var"#9#18"{IOStream, Base.DevNull, Nothing, Vector{Pair{String, String}}, Float64, Nothing, Bool, Bool, String, Int64, Bool, Bool}, ::Downloads.Curl.Easy)@Curl.jl:60
#8@Downloads.jl:298[inlined]
arg_write(::Downloads.var"#8#17"{Base.DevNull, Nothing, Vector{Pair{String, String}}, Float64, Nothing, Bool, Bool, String, Int64, Bool, Bool}, ::IOStream)@ArgTools.jl:112
#7@Downloads.jl:297[inlined]
arg_read@ArgTools.jl:61[inlined]
var"#request#5"(::Nothing, ::IOStream, ::Nothing, ::Vector{Pair{String, String}}, ::Float64, ::Nothing, ::Bool, ::Bool, ::Nothing, ::typeof(Downloads.request), ::String)@Downloads.jl:296
(::Downloads.var"#3#4"{Nothing, Vector{Pair{String, String}}, Float64, Nothing, Bool, Nothing, String})(::IOStream)@Downloads.jl:209
arg_write(::Downloads.var"#3#4"{Nothing, Vector{Pair{String, String}}, Float64, Nothing, Bool, Nothing, String}, ::Nothing)@ArgTools.jl:101
#download#2@Downloads.jl:208[inlined]
download(::String, ::Nothing)@Downloads.jl:208
#invokelatest#2@essentials.jl:708[inlined]
invokelatest@essentials.jl:706[inlined]
do_download@download.jl:33[inlined]
download@download.jl:29[inlined]
top-level scope@Local: 1[inlined]

xxxxxxxxxx
 
serialized_model_path = download("http://cot.llc/mnist_conv")

---

model

UndefVarError: serialized_model_path not defined

top-level scope@Local: 1

xxxxxxxxxx
 
model = open(io -> deserialize(io), serialized_model_path)

---

To test our model we will only load in the test data. Our model was trained with training data from MNIST.traindata() in another notebook.

xxxxxxxxxx

3.7 μs

xxxxxxxxxx
 
test_x, test_y = MNIST.testdata();

1.3 s

test_x shape: (28, 28, 10000), test_y shape: (10000,)

xxxxxxxxxx

9.4 ms

Testing the model (and building the API too!)

xxxxxxxxxx

2.0 μs

First we assign a variable input_images to a small slice of test data.

xxxxxxxxxx

3.8 μs

Start Index:

End Index:

xxxxxxxxxx

27.9 ms

safe_start_index

xxxxxxxxxx
 
safe_start_index = max(start_index |> default(1), 1)

200 ns

safe_end_index

xxxxxxxxxx
 
safe_end_index = min(end_index |> default(10), length(test_y))

400 ns

input_images_slice

1:10

xxxxxxxxxx
 
input_images_slice = min(safe_start_index, safe_end_index):max(safe_start_index, safe_end_index)

100 ns

xxxxxxxxxx
 
input_images = Flux.unsqueeze(test_x, 3)[:, :, :, input_images_slice];

221 ms

For example, the first (and only) element in the sample is a 7

xxxxxxxxxx

4.0 μs

xxxxxxxxxx
 
display_digit(input_images[:, :, 1, 1])

6.8 ms

Passing our input_images through the model loaded earlier, we get a 10x1 matrix, where each column corresponds to an input image, and each row corresponds to the class which the model thinks the image corresponds to. For example, a high value in the first row corresponds to a high confidence that the image contains a 0 digit.

The highest value by far is in the 8th index, which corresponds to the model predicting a 7 digit.

xxxxxxxxxx

4.9 μs

predictions

UndefVarError: model not defined

top-level scope@Local: 1

xxxxxxxxxx
 
predictions = model(input_images)

---

The last step is to convert these predictions into numbers, then compare them to their true labels

xxxxxxxxxx

3.4 μs

output_labels

UndefVarError: predictions not defined

top-level scope@Local: 1

xxxxxxxxxx
 
output_labels = Flux.onecold(predictions, 0:9)

---

test_labels

Int641

xxxxxxxxxx
 
test_labels = test_y[input_images_slice]

1.8 μs

Finally we can measure the accuracy of the model by comparing our predictions to the actual labels and finding the average.

xxxxxxxxxx

2.9 μs

UndefVarError: output_labels not defined

top-level scope@Local: 1

xxxxxxxxxx
 
Int.(output_labels .== test_labels)

---

accuracy

UndefVarError: output_labels not defined

top-level scope@Local: 1

xxxxxxxxxx
 
accuracy = mean(output_labels .== test_labels)

---

Helpers

xxxxxxxxxx

1.5 μs

default (generic function with 1 method)

xxxxxxxxxx
 
function default(x)
    return y -> (isnothing(y) || isnan(y)) ? x : y
end

38.7 μs

display_digit (generic function with 1 method)

xxxxxxxxxx
 
function display_digit(img)
    Gray.(permutedims(img, (2, 1)))
end

26.8 μs